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Abstract 

We construct a two-sample test for comparison of long memory parameters based 
on ratios of two rescaled variance (V/S) statistics studied in [Giraitis L., Leipus, R., 
Philippe, A., 2006. A test for stationarity versus trends and unit roots for a wide 
class of dependent errors. Econometric Theory 21, 989-1029]. The two samples have 
the same length and can be mutually independent or dependent. In the latter case, 
the test statistic is modified to make it asymptotically free of the long-run correlation 
coefficient between the samples. To diminish the sensitivity of the test on the choice of 
the bandwidth parameter, an adaptive formula for the bandwidth parameter is derived 
using the asymptotic expansion in [Abadir, K., Distaso, W., Giraitis, L., 2009. Two 
estimators of the long-run variance: Beyond short memory. Journal of Econometrics 
150, 56-70]. A simulation study shows that the above choice of bandwidth leads to a 
good size of our comparison test for most values of fractional and ARMA parameters 
of the simulated series. 

Keywords: Long memory, two-sample test for comparison of memory parameters, long- 
run covariance, rescaled variance, bandwidth choice 



1 Introduction 

Long memory is one of the most widely discussed "stylized facts" of financial time series 
(see, e.g., Teyssiere and Kirman (2007)). In real data, long memory can be confused with 
short memory, unit roots, trends, structural changes, heavy tails and other features. Various 
tests for long memory have been developed in the literature. See Lo (1991), Kwiatkowski 
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et al. (1992), Robinson (1994), Lobato and Robinson (1998), Giraitis et al. (2003, 2006), 
Surgailis et al. (2008). Most of these results pertain to the case of a single sample. 

A natural extension of one-sample test about unknown long memory parameter d is two- 
sample testing for comparison of respective memory parameters d\ and d<i- In particular, 
such test can be useful for the memory propagation (from durations to counts and realized 
volatility), question discussed in Deo et al. (2009). Several studies compare estimates of long 
memory parameter from different foreign exchange data and other sources (Cheung (1993), 
Soon et al. (2006), Casas and Gao (2008)). Two-sample testing is also related to the 
change-point problem of the memory parameter discussed in Beran and Terrin (1996), 
Horvath (2001). 

The present paper constructs a test for testing the null hypothesis d\ = d2 that long 
memory parameters di £ [0, 1/2) of two samples of length n, taken from respective stationary 
processes Xj, i = 1,2, are equal, against the alternative d\ ^ di- The test statistic, T n , is 
defined as a sum 

of two ratios of V/S, or rescaled variance, statistics V\/ Su A and V2I ' $22,q computed from 
samples (Xi(l), . . . ,Xi(n)) and (A^l), . . . ,X2(n)). Here, Vi is the empirical variance of 
partial sums of Xj and Sa g is the Newey-West or HAC estimator of the long-run variance 
of Aj. The V/S statistic was developed in Giraitis et al. (2003, 2006) following the works of 
Lo (1991) and Kwiatkowski et al. (1992) on related R/S type statistics. In particular, from 
Giraitis et al. (2006) one easily derives the asymptotic null distribution T of the statistic 
T n under the condition that the two samples are independent. It is also easy to show that 
for d\ 7^ g?2, one of the ratios in (II. lj) tends to infinity and the other one to zero, meaning 
that the test is consistent against the alternative d\ 7^ cfo. 

However, independence of the two samples is too restrictive and may be unrealistic in 
financial data analysis since price movements of different assets are usually correlated and 
susceptible to common macroeconomic shocks. In order to eliminate the eventual depen- 
dence between samples, a modification T n of (jl.ip is proposed, which uses residual observa- 
tions (Ai(l), . . . , Ai(n)), obtained by regressing partial sums of Ai on partial sums of A2. 
The modified statistic T n is shown to have the same limit null distribution T as if the two 
samples were independent. 

It is well-known that a major problem in applications of the rescaled variance and related 
statistics is the choice of the bandwidth parameter q. The present paper contributes to this 
problem by providing an adaptive formula in (|6.32|) for "optimal" q which depends not only 
on the (common) memory parameter d but also on the difference between estimated short 
memory (AR) components of the spectrum of the sampled series. The derivation of the last 
result uses the expansion of the HAC estimator in Abadir et al. (2009). A simulation study 
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confirms that using this choice of bandwidth leads to a good size of our comparison tests 
for most values of fractional and ARM A parameters of the simulated series. 

Several interesting open problems were suggested by Referee and Associated Editor. The 
assumption of stationarity can be very restrictive for applications. We expect that our 
results can be extended to values of di outside the interval [0, 1/2). Another possibility for 
future research is development of similar procedures to test equal memory for more than 
two series. 

The plan of the paper is as follows. Section [2] formulates the settings of the paper (As- 
sumptions A(di, c?2) and B(dj, d®)) and derives the limit of the test statistics T n and T n and 
the rejection region of the null hypothesis d\ = cfo. Assumption K(d\,d2) guarantees the 
existence of long-run (co)variances and the consistency of the HAC estimators. Assumption 
B(di, d-z) specifies the joint limit behavior of partial sums of X\ and Xi as given by bivari- 
ate fractional Brownian motion. The last process is defined by means of stochastic integral 
representation as in Chung (2002). The test procedures are then presented in detail and a 
brief study focus on the asymptotic power of the tests T n and T n . Section [3] verifies Assump- 
tions A(di,d2) and B(d\,d2) for bivariate moving average {X\,X2). Section 0] assesses the 
performance of the tests T n and T ra , by simulating bivariate FARIMA samples with various 
fractional and autoregressive/moving average parameters. Conclusions are given in Section 
[5j The Appendix contains auxiliary results and derivations. 

Notation. Below, — > p , — ^ aw , — ^_d[o,i] an d — >idd (=fdd) stand for the convergence in proba- 
bility, the weak convergence of random variables, the weak convergence of random elements 
in the Skorohod space D[Q, 1], and the weak convergence (equality) of finite dimensional 
distributions, respectively. Relation '~' means that the ratio of both sides tends to 1. 

2 Construction of tests and its properties 
2.1 Assumptions and main results 

Let ((Xi(t), X2(t)), t £ Z) be a bivariate covariance stationary process, viz., a sequence of 
random vectors {X\(t) , X2{t)) S M 2 such that EAj(i) = \i\ and 



do not depend on t £ Z for any h £ = 1,2. Introduce the popular Bartlett-kernel 

estimator of the long-run (co)variance: 



cov(Xi(t),Xj{t + h)) =-iij{h) 




(2.1) 
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where 



( n—h 



n < 



^(XiW-XiXXjit + ty-Xj), h>0, 



t=l 

n 



(2.2) 



]T (Xi(t) - Xi^Xjit + h)- Xj), h<o, 

k t=l-h 



Xi = n 1 X^™=i^(0- The estimator in (12. ip is also called the heteroskedasticity and 
autocorrelation consistent (HAC). Also define 



C*0 



El' 

h=—q 



\h\ 
9+1 



(2.3) 



where 



/ n-h 



J2(Xi(t)- mXXjit + h) - fxj), h>0, 
t=l 

n 

(Xi(t) - fii)(Xj(t + h)- iij), h<0. 



(2.4) 



V t=l-h 

Note %j{h) = jji(-h), 7° (/i) = %{-h), S 12 , q = S 2 i, q , S^ 2 q = S^ q . 

Assumption A(di,d 2 ) There exist d{ G [0, 1/2), i = 1,2 such that for any i,j = 1,2 the 
following limits exist 



n— >oo ji 



^(j2(Mt)-^)) (fix, 
i n 



(s) - ^) 



(2.5) 



t,s=l 



Moreover, 



Y iij{k - 1) 



kl=l 



Y Hj(k - I) 

k,l=l 



(2.6) 



as q — > oo, n — > oo,n/q — > oo. 

Remark 2.1 The asymptotic constant is called the long-run covariance of Xi and Xj. 
Condition (|2,6p is similar to Giraitis et al. (2006, Assumption A. 2). It guarantees the 
consistency of the HAC estimator (see below) . 
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Proposition 2.2 Let Assumption A(d\,d2) hold. Then, as q —> oo,n — > oo, n/q — > oo, 

q- di - d iS iM -> p dj, q- d ^S° lhq ^ p dj (ij = 1,2). (2.7) 

Moreover, 

\{S im -S^ q ) = -(X, - Mi )(l + Qp(l)). (2.8) 

Assumption B(di,o?2), below, specifies the joint limit of partial sums of X\ and X2. It 
is similar to Giraitis et al. (2006, Assumption A.l). The limit process (bivariate fractional 
Brownian motion) is defined through a stochastic integral representation (|2.9p similarly as 
in Chung (2002, (6)). Equivalently, this process can be defined through the covariance 
function defined in (16.11). 



Definition 2.3 A nonanticipative bivariate fractional Brownian motion (bi-fBm) with mem- 
ory parameters di £ (—1/2, 1/2), i = 1,2, is a Gaussian process B = ((B\(s), B%{s)), s£K) 
admitting the following representation for i = 1,2 

BM) = [ c ^ ((* ~ x )+ ~ W ^ dx )' d i + °> (2 9) 

[Wi(0,t], ' ifdi = f 

W = ((Wi(cLe), W^cLc)), 1 £ I) is a 2— dimensional Gaussian independently scattered 
white noise with real components, zero mean and covariance matrix 

EW i (dx)W j (dx) = dx\ ' *~ J ' , (2.10) 

{Pw, l T3 

for some pw £ [ — 1, 1] , and the constants c(di) are determined by condition EBf(l) = 1 so 
that 

^-(/(a-tf-^N-'- ewT^Vi) - 

where ~B(p,q) is the beta function and x+ = max(x,0). 

Remark 2.4 The nonanticipative bi-fBm is a particular case of general bi-fBm having the 
stochastic representation 

X(t) = [ {((t-x)°-(-x)°)A + + ((t-x)°-(-x)°)A„}W(dx), 

JR 

where D = diag(<ii, cfe), x - = max(— x,0), A + ,A_ are real 2x2 matrices and W(dx) = 
(W\[dx) ,W2{dx)) , x G M, is an independently scattered Gaussian white noise with zero 
mean, unit variance and independent components; see Didier and Pipiras (2010), also 
Lavancier et al. (2009, (1.6)). The representation in (|2.9|) corresponds to the matrices 
A A * - ( < d ^ 2 <*M<k)p\ , n 

+ + " \c{dMd2) P C{d 2 ? ' ' • 
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Remark 2.5 In the sequel by bi-fBm we mean the nonanticipative process in (|2.9p . A 
bi-fBm has stationary increments and the self-similarity property: 

(A- dl - 5 B 1 (At),A- da - 6 B 2 (At)) = fdd (B 1 (t),B 2 (t)) 

for any A > 0. Lavancier et al. (2009) showed that these properties essentially determine 
the covariance function in (|6.ip - (|6.3p . up to a choice of constants gij and gi defined in (|6.2p 
and (|6.4p (see Section [fTT]) . 

Note that each component Bi is a univariate fractional Brownian motion with variance 
EBf(t) = \t\ 2di+1 , % = 1,2. Also note that (-Bi(l), B 2 {1)) has a bivariate Gaussian distri- 
bution with zero means, unit variances and the correlation coefficient p = ~EB\(1)B 2 (1) = 
pyyK^di, (I2), where n{d\,d 2 ) depends only on d\,d 2 . In the case d\ = d 2 , we have that 
P = Pw and the process {B\{t) = B\{t) — pB 2 (t),t G M) is a fractional Brownian motion 
with variance F*Bf(t) = (1 — p 2 )\t\ 2dl+1 . Moreover, the processes B\ and B 2 are indepen- 
dent. Indeed, from (|2.9|) it is immediate that B\ has a similar stochastic representation 
with W\(dx) replaced by Wi(dx) = W\(dx) — pW 2 (dx), with W\ independent of W 2 . 



Assumption H(di,d 2 ) Assumption A(d\,d 2 ) is satisfied and, moreover, 



n- dl - 



,-(1/2) ^(X l( t) - EA^)),n^-(V 2 ) £™(X a (t) - EX 2 (t))) 
^fdd (^i(r),^ 2 (r)), (2.11) 

where Cjj are the same as in (|2.5p . Cjj > 0, i = 1,2 and (B\, B 2 ) is a bi-fBm with memory 
parameters d\,d 2 and the correlation coefficient p = corr(l?i(l), B 2 (l)) = c\ 2 j ^/c\\c 22 G 
[-1,1]- 

Define the empirical variance of partial sums of Xi : 

n I k \ 2 / n k \ 2 

V * = n- 2 £ ^(X^-X.) -n" 3 . (2.12) 

fc=i \t=i / \fc=i t=i / 

The following proposition obtains the limit distribution of the statistic T n defined in (jl.ip . 

Proposition 2.6 Xei Assumptions A(d\,d 2 ) and B(d\,d 2 ) be satisfied with some d\,d 2 G 
[0, 1/2) and p G [—1, 1], and let n, q,n/q — )• 00. 

(i) If d\ = d 2 = d then 

U\ U 2 , 
T n ^i aw T = j^ + jf, (2.13) 
U 2 U\ 

where 

Ut = l\Bf(r)) 2 dT-(j\f(r)d^j , , = 1,2, (2.14) 
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where (Bf(r) = Bi(r) — rBi(l) , r E [0, 1]), i = 1,2 are fractional Brownian bridges obtained 
from bivariate fBm ((B\{t), B 2 (t)),t E R) mi/i i/ie same memory parameters d\ = d 2 = d 
and correlation coefficient p = pw (see Definition \2.3\) . 

(ii) If d\ / c?2 ^en 

T n -+ p oo. (2.15) 

Let Vi,5n j9 be the statistics in (|2.12p . (|2.ip . respectively, where = 1, ■■■,n is 

replaced by 

X^t) = X 1 (t)-(S 12 , q /S 22>q )X 2 (t), t = l,...,n. (2.16) 

In particular, note 

<Sll,g = S'll.g-TH^- ( 2 - 17 ) 
J22,q 

Define 

" - T7T? h 7775 — • ^ ^ 

1/2/^22,9 Vi/5n, g 

Note, T n is obtained by replacing V\, S\\ s in the definition of T n in (jl.ip by the correspond- 
ing quantities Vi,Su iq as defined above. 

In the following proposition, we prove that under the null hypothesis, the limit distribu- 
tion of T n is free of p, contrary to T in (12. 14f> . Note that the limit of T n coincides with 
(|2.14p when p = 0. This occurs for example when the statistics T n is calculated from two 
independent processes X\, X 2 . 

Proposition 2.7 Let Assumptions A(d\,d 2 ) and B(d±,d 2 ) be satisfied with some d\,d 2 E 
[0, 1/2) and p E (—1, 1), and let n, q,n/q —> oo. 

(i) If d\ = d 2 = d then 

f n "Maw f = ^ + ^, (2.19) 

where Ui,U 2 are independent and have the same distribution in {2.11$ . 

(ii) If di > d 2 then 

f n ^ p oo. (2.20) 



(Hi) If d\ < d 2 then 



P 2 , I-P 2 



P -I 2 ' 2 

1 — fT p £ 



+ ^~. (2.21) 
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Remark 2.8 The ratio (3 = S\ 2s /S 22:q m f)2. 16j) minimizes the sum of squares: 

n (i+q)/\n (i+q)An 
i=l-5 t=iVl t=iVl 

Therefore, p = fiyj iS^q) 'Su,q can be considered as the least squares estimate of the long-run 
correlation coefficient p between partial sums of the two samples. 

2.2 Testing procedures 

Let t a (d) denote the upper a— quantile of the r.v. T defined in (|2,19p (or T in (|2,14p when 
p = 0), viz. 

a = P(f > t a (d)), a 6 (0,1). (2.22) 

Let 

d=(di + d 2 )/2, (2.23) 
where d{ is an estimator of di satisfying 

dk-ck = Op(l/logn) (* = 1,2). (2.24) 

Similarly to Giraitis et al. (2006, Lemma 2.1), it can be proved that the quantile function 
t a (d) is continuous in d G [0,1/2) for any a E (0,1). Therefore, the estimated quantile 
t a (d) — > p t a (d) as n — > oo and the asymptotic level of the tests associated to the critical 
regions in (12.25j) - (j2.26H is preserved by replacing t a (d) by t a (d). 

Testing the equality of the memory parameters in the case of independent samples. We wish 
to test the null hypothesis d\ = d 2 against the alternative d\ ^ d 2 under the assumption that 
Xi and X 2 are independent. The decision rule at a— level of significance is the following: 
we reject the null hypothesis when 

T n > t a (d). (2.25) 
The consistency of this test is ensured by Proposition 12.61 (ii). 

Testing the equality of the memory parameters in the case of possibly dependent samples. 
We wish to test the null hypothesis d\ = d 2 against the alternative d\ > d 2 in the general 
case when X\ and X 2 are possibly dependent. The decision rule at a— level of significance 
is the following: we reject the null hypothesis when 

f n > t a (d). (2.26) 

The consistency of this test is ensured by Proposition 12.71 (ii). 
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Remark 2.9 For testing d\ = di against d\ < c?2, the samples X\ and X2 should be 
exchanged in the statistic (|2.18p . 



Remark 2.10 As noted by the referee, an undesirable feature of the testing procedure in 
(]2.26p is that a rejection might occur not only when d\ > di but also when d\ < di (see 
Proposition 12.71 (hi)). To alleviate this feature, in (|2.26p one can use the statistic 



Vi/S u , q 

V2/ 5 , 22, 9 



instead of T n . Note T n = T+ + (T+) _1 . Under the assumptions of Proposition [2T71 the limit 
distribution of can easily be obtained from the proof of this proposition. In particular, 
if d\ = d% = d, then 

f+ -H f + = — 
J-n 'law jj 1 

where U\ , U2 are independent and have the same distribution in (|2.14p . From the proof of 
Proposition ^. 7\ it also follows easily that — > p 00 (di > cfo) and — > p p 2 /(l — p 2 ) (di < 
d2), i.e., does not explode to infinity for d\ < cfe and |p| not very close to 1. 

2.3 Asymptotic behavior of the power function 

We discuss in this section the asymptotic power of the testing procedures in (12.25j) and 
(f^26D . 

For testing d\ = c?2 against d\ > c?2, we have proved that the rejection probability of the 
null hypothesis tends to 1, i.e., that 

P(f n > t a (d)) -> 1, (2.27) 

for any a G (0, 1). 

Section 0] provides finite sample rejection frequencies of the null hypothesis for some 
choices of parameters d\,d2 and some bivariate FARIMA models. A natural question in this 
context is to estimate the convergence rate in (|2.27|) . or the decay rate of the probability 
P(Tn < a), as a function of a,n,d±,d2, p and (possibly) other quantities of the model 
assumptions. 

From the proof of Proposition 12.71 (ii), see (|6.14p below, we have that for d\ > c?2, the 
normalized statistic T n has a limit distribution, 

( q /nf^)f n = { q /nf^^l^L+o p (l) ^ law U \ =A, (2.28) 

V-2/S-22,q {l-p^)U 2 

say, where Ui,i = 1,2 are defined in (|2.14p . Therefore, we can expect that the probability 
~P(T n < a) decays as the probability P(A < a(q/n) 2 ^ dl ~ d2 ^), when n — > 00. The decay 
rate of P(A < x), x — > is unknown, even for independent U%, U2, but in principle can be 
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estimated from Monte-Carlo experiments. It is also plausible that r.v. A has a bounded 
probability density near x = and so the above discussion suggests a decay rate P(T n < 
a) = 0{{q/n) 2 ^~ d ^). 

However, the above argument is heuristic; in particularly, the replacement of T n by 
(q/n) 2 ( d2 ~ dl ^ A is not rigorously justified. It is clear that in order to correctly assess the 
probability P(T n < a), it is necessary to control from above the probabilities of high values 
of V\ and S22 a in the denominator of the ratio T / / Q 11,q = ^ ~? 2 ' q and the probabilities of 
small values of V2, <Sii )9 in the numerator of the last ratio. While the former probabilities 
can be controlled by the Markov inequality, direct estimation of the latter probabilities is 
difficult and is replaced by an assumption on the distribution functions of these r.v.'s, see 
(|2T29]) below. 

Proposition 2.11 Let Assumption A{d\,d2) be satisfied with < c?2 < d\ < 1/2. More- 
over, assume that the distribution functions of the normalized r.v. 's n~ 2dl V\ and q~ 2d2 S22,q 
satisfy the following condition: for any M > 0, oq > there exists a constant K such that 
the inequalities 

P(n~ 2di yi < a) < Ka, P(q~ 2d S 2 2, g < a) < Ka (2.29) 

are satisfied for any n > M, q > M, n/q > M and any < a < a$ . Then there exists a 
constant K%, independent ofn,q,a, and such that 

P(T n < a) < K ia W (i) (dl " d2)/2 , P (f n < a) < (i) (dl ^ )/2 (2 . 30 ) 

hold for all n,q,n/q sufficiently large and any a > from a compact set. 

The proof of the above proposition is given in Section 16.21 Note that condition (|2.29p is 
implied by the existence of bounded probability densities of n~ 2dl V\ and q~ 2d ' 2 S22,q- Also 
note that the assumptions of Proposition 12 . 1 1 1 refer to marginal distributions of V\ and S22,<j 
only, and do not impose a restriction on the joint distribution of the four statistics in the 
definition of T n and T n . 



3 Application to bivariate linear processes with long mem- 
ory. 

In this section we specify Assumptions A(di, c^) and B(di, a\) to a class of bivariate linear 
models (Xi(t),X2(t)), t £ Z as given by 

00 00 
Xi(t) = ^ii(fc)&(t-AO + £Vta(fc)&(t-*), * = 1,2, (3.1) 

fc=0 k=0 

where ipij(k) are real coefficients with X^fcLo ^iji^) < 00 an< ^ ^(t)), t G Z is a bivariate 

(weak) white noise with nondegenerate covariance matrix (p^ij)ij=i t 2- In other words, 



10 



* £ Z is a sequence of random vectors with zero mean E£i(i) = E^(^) = 
and covariances 

mm* - {** ™ 

Without loss of generality, below we shall assume p^n = p^^2 = 1> P(,i2 = P£,2i = Ps, £ 
(-1,1)- 

Assumption A(d{j) (Xi(t), X 2 (t)) is a bivariate linear covariance stationary process as in 
(|3.ip with coefficients t(>ij(k) satisfying the following conditions: 

• if d i:j e (o, 1/2) 

Aj(k) = {a %J +o(l))\k\ d ^ 1 (fc^oc) 
where a™ ^ are some numbers, i,j = 1,2. 



If dij = 



^2\Aj(k)\ < oo, ay = ^2^ij{k). 

k=0 k=0 



Assumption B(djj) Assumption A(dij) is satisfied and, moreover, £2^)), t G Z is a 

sequence of i.i.d. random vectors. 

Proposition 3.1 (i) Let (Xi(t), Xz{€)) satisfy Assumption A(d,ij). Then the limits Cij in 
Assumption A(d\,d2), A2. 5\) exist, with 

di = max{da,^2} e [0,1/2) (i = 1,2). (3.3) 

(ii) Let (X±(t), X2(t)) satisfy Assumption B(dij) and there exists 5 > such that E|£j(i)| 2+<5 < 
oo (i = 1,2). Then (Xi(t),X2(t)) satisfies Assumptions A{d\,d 2 ) and B(d\,d2), with di 
as defined in A3.3\) . Moreover, the finite- dimensional convergence in Assumption B(d\,d2), 
\2.11\) extends to the functional convergence in the Skorohod space D[0, 1]. 

Remark 3.2 Proposition 13.11 (ii) complements Chung (2002, Th.l), who discussed con- 
vergence of partial sums of K— variate linear processes to K— variate fractional Brownian 
motion under slightly different assumptions. Proposition 13.11 (i) and Proposition 12.21 (|2.7|) 
also complement the result in Abadir et al. (2009) about consistency of the HAC estima- 
tor for linear processes, by relaxing the 4th moment condition on the noise in the case 
di € [1/4,1/2). 

Let us consider some parametric examples of bivariate linear processes. Hereafter, we 
denote by L the backward shift i.e. LX{i) = X(t — 1). 
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Example 3.3 Let aij E K (i = 1,2) be some constants, and let 

Xi(t) = (l-L)- d ^aaCi{t) + a i2 Ut)) (» = 1,2) (3.4) 

be FARIMA(0, dj, 0) processes, with possibly different parameters dj G (0,1/2). This pro- 
cess satisfies Assumption A(dij) with djj = dj and aij = r(di) -1 ay = 1,2). 

If (£i(t), £2(2)) form a sequence of i.i.d. vectors as in Assumption B(dy), partial sums of 
(.Xi(i), A 2 (t)) converge to a bivariate fBm (-B li ^ 1 , £? 2 ,d 2 ). 

The limiting fBm has independent components if and only if the noise has uncorre- 
cted components, i.e., if E(an£i(i) + ai 2 £ 2 (i))(a 2 i£i(i) + a 22 £ 2 (i)) = 0. For d x = d 2 , 
the last condition is equivalent to the uncorrelatedness of the components of the process: 
EX 1 (t 1 )X 2 (t 2 ) = 0(t 1 ,t 2 G Z). 

Example 3.4 Consider the following system of linear equations: 

(l-L)**X 1 (t)+P(l-L)**X 2 {t) = 

(1-L)^ 2 (t) = £ 2 (t), 

where d^- G [0, 1/2), /? G R are parameters, d 22 + d' 12 — d' 12 < 1/2 and where i G 

Z are as in (|3.1|) . A covariance stationary solution of the above equation is given by 

X 2 (t) = (l-L)-^ 2 (i), 

Ai(t) = (1 - L)- d 'u£ x (t) - 0(1 - L) d 'r2~ d 'ii- d 22&(t). 

Then (Xi(i), X 2 (i)) satisfies Assumption A(djj) with dn = d' ll5 di 2 = d' 22 +d' u — d' 12 , d 2 i = 
and d 22 = d 22 . 

Assume (£i(t), £2^)) is a sequence of i.i.d. vectors satisfying the conditions in Proposition 
13.11 (ii) and let (3 ^ 0. There are three cases d' 22 > d' 12 , d' 22 < d' 12 and d' 22 = d' 12 leading to 
dn < di2, dn > di 2 and dn = di 2 , respectively. In each of these cases we can determine the 
memory parameter dj of (Xi(t)),i = 1,2 and the limiting bi-fBm in Assumption B(di,d 2 ), 
together with the correlation coefficient p. 

4 A simulation study 

In this section we assess the finite-sample performance of our procedures to test d\ = d 2 
versus d\ > d 2 and provide a practical recommendation for the choice of the bandwidth 
parameter q. 

The memory parameters d\ and d 2 are estimated with the help of the adaptive version 
of the FEXP estimator (see Iouditsky et al. (2001)), which in practice turns out to be 
less sensitive to the short memory part of the long memory process as compared to other 
estimators. The bandwidth parameter is chosen according to the adaptive formula ()6.32l) 
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derived in Appendix (see Section F6 . 3 [) . The optimisation of the bandwidth is realised under 
the null hypothesis in order to ensure a good size to the test procedure. The choice of q 
depends on d = \{d\ + 0^) as in the expansion obtained by Abadir et al. (2009, (2.14)), but 
also takes into account the short memory spectrum, in a form of certain coefficient. 

The simulated samples are independent or dependent Gaussian FARIMA processes with 
different fractional and autoregressive/moving average parameters. The 5% quantile func- 
tion in (|2.22p was approximated from extensive Monte-Carlo experiments by 

h%(d) ~ 3.7d 2 + 8.6d + 5.2. (4.1) 

Independent samples. Table 1-3 concerns the case of independent samples, the test proce- 
dure is based on T n . Tables 1 and 2 report the percentages of rejection of the null hypothesis 
d\ = di of the test T n > t^%{d) from 1000 replications of independent FARIMA(1, d, 0) sam- 
ples of size n £ {1024, 4096}, for five values of di £ {0, .1, .2, .3, .4} and three values 
en £ {0, .4, .8} of the autoregressive parameter. Recall from Proposition 12.61 that for in- 
dependent samples T n and T n have the same limiting distribution. We can see from both 
tables that the T n test has fairly good size for most values of di and Oj. Table 3 provides the 
mean values of q. Since these values are rather scattered across the table and the rejection 
frequency is very sensitive to the choice of q, the general impression from Tables 1-3 is that 
the adaptive choice of q in (|6.32p was necessary. We also note that the power of the test 
decreases with increase of \a\ — a^\. The last fact can possibly be explained by the bias 
induced by the AR part in the FEXP estimator of di. 

Dependent samples. Table 4 reports the performance of the test T n > t§%(d) on dependent 
samples as in Example 3.3, with an = &22 = 1 — P, ai2 = ^21 = P, where p S [0, 1/2) is 
a parameter. In other words, Aj are FARIMA(0,dj,0) processes with mutually correlated 
innovations. The asymptotic correlation coefficient p between normalized partial sums of 
Ai and A2 is proportional to 2p(l — p)/(p 2 + (1 — p) 2 ) and so p increases monotonically 
from to 1 as p increases from to 1/2. Since p = corresponds to independent samples, 
the results in Table 4 can be compared with those for a\ = a>2 = in Tables 1 and 2. It 
appears that both tests T n and T n perform similarly and that the long-run parameter p is 
well estimated to be zero. 

The purpose of Tables 5 and 6 is twofold. Firstly, we want to evaluate the performance 
of T n on independent samples. Secondly, we want to assess the robustness of the adaptive 
formula (|6.32p for bandwidth based on AR approximation of the short memory part with 
respect to other short memory specifications. To this end, we generate a FARIMA(3, d, 0) 
process with polynomial AR function 1 + 0. 7x 3 (Table 5) and a FARIMA(0,<i,2) process with 
polynomial MA function 1 — (l/6)x + (l/6)x 2 (Table 6). Together with the zero hypothesis 
rejection frequencies, Tables 5 and 6 also provide the (averaged) values of the adaptive 
estimator of the bandwidth q. One can immediately recognize that the estimated values of 
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q in Table 5 are much greater than the corresponding values on Table 6; nevertheless the 
size of the T n test is respected in both tables. One can conclude from Table 6 that the 
adaptive formula for q works rather well even if the FAR model (on which this formula is 
based) is misspecified. 
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Table 1: Frequency of rejection (in percentages) of the null hypothesis of the test T n > 
c^%(d). The samples are simulated following FAR(l,cij) models. For fixed a±,a2, each 
cell contains a triangular array of dimension 5x5 corresponding to the different parameters 
(di,d 2 ) with di £ {0, .1, .2, .3, .4} (i=l,2) and d x > d 2 . The sample size is n = 1024. The 
estimation is based on 1000 replications. 
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Table 2: The same results as Table 1 for sample size n = 4096. 
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Table 3: The mean values on 1000 replications of q according to (|6.32p for the simulations 
of Table 2. 
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Table 4: Frequency of rejection (in percentages) of the null hypothesis of the test T n > 
c 5 %(d). The samples are simulated following the model in (|3.4p . For fixed p, each cell 
contains a triangular array of dimension 5x5 corresponding to the different parameters 
(di,d 2 ) with di e {0, .1, .2, .3, .4} (i=l,2) and d 2 < d x . 
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Table 5: [Left] Frequency of rejection (in percentages) of the null hypothesis of the test 
T n > c$%{d). The two samples X\ and X 2 are independent. X± is simulated from 
FARIMA(3,di,0) model with polynomial AR function l+.7x 3 and X 2 from FARIMA(0,d 2 ,0) 
model. [Right] Adaptive estimation of the bandwidth parameter q. The sample size is 4096 
and the statistics are evaluated from 1000 independent replications. 
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Table 6: [Left] Frequency of rejection (in percentages) of the null hypothesis of the test 
T n > c 5 %(d). The two samples Xi and X 2 are independent. Xi is simulated from 
FARIMA(0,di,2) model with polynomial MA function 1 - (l/6)x + (l/6)x 2 and X 2 from 
FARIMA(0,<i2,0) model. [Right] Adaptive estimation of the bandwidth parameter q. The 
sample size is 4096 and the statistics are evaluated from 1000 independent replications. 
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5 Concluding remarks 



The paper constructs a two-sample test for comparison of long memory parameters di G 
[0, 1/2) , z = 1,2 of covariance-stationary time series Xi,i = 1,2 with discrete time. The 
test statistic, T n , is defined as the sum of the ratio and the reciprocal ratio of the rescaled 
variance (V/S) statistics, computed for each sample, whose asymptotic and finite-sample 
behavior was studied in Giraitis et al. (2003, 2006). Under some assumptions which involve 
the existence of long-run covariances and the joint convergence of partial sums of X\ and X 2 
to a bivariate fractional Brownian motion, we derive the asymptotic null distribution of T n . 
A modification T n of the test statistic T n is introduced and shown to be asymptotically free 
of the long-run correlation coefficient between the two samples. The case when (Xi,X 2 ) 
form a bivariate linear process is discussed in detail. Simulation results using FARIMA 
samples with various fractional and autoregressive parameters show that the proposed tests 
have a good size for most values of fractional and autoregressive/moving average parameters. 
The robustness property of the test is largely due to our choice of bandwidth according to 
the adaptive formula in (|6.32p which takes into account the estimated difference of short 
memory spectrum of the sampled processes. The derivation of the last formula uses the 
asymptotic expansion of the HAC estimator in Abadir et al. (2009). 

6 Appendix. Proofs and auxiliary results 

Subsection 6.1 provides alternative definition of bi-fBm by explicit cross-covariance function. 
Subsection 6.2 contains proofs of Propositions 12.21 EH 12.71 12.111 and 13.11 Subsection 6.3 is 
given to the derivation of the adaptive bandwidth formula in (|6.32[) . 

6.1 Covariance function of bivariate fractional Brownian motion 

From (|2.9p and Samorodnitsky and Taqqu (2006) we have for any s, t G M 

EBi(s)Bi(t) = ^{\s\ 2d * +1 + \t\ 2d > +1 - \t - s| 2dl+1 }, i = 1,2. 

The analytic expression of cross-covariance FiBi(s)B 2 (t) is derived in Lavancier et al. (2009). 
It takes a different form in the cases d\ + d 2 7^ and d\ + cfe = 0. Let 



, . B(di + l,d 2 + lWcos^ivr) cos(d 2 7r) 
w{d\.d 2 ) = — , 

s/Bick + 1, di + l)B(d 2 + 1, d 2 + 1) 

Let di + d 2 / (d 1 ,d 2 G (-1/2, 1/2). Then 

EB 1 (s)B 2 (t) = P^{g l2 ( s )\ S \ d ^ +1 + g 21 (t)\t\ d ^ +1 - g 2l (t - s)\t - S |^ +1 },(6.1) 
where 

/ 9ij, t > 0, 

gij{t) = \ 9ji , t<0 
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and where 



5i2 = ip(d 1 ,d 2 )sm(diiT)/ sm((di+d 2 )ir), #21 = ip(di,d 2 )sm(d 2 Tr)/ sin((di+d 2 )ir). (6.2) 
In the case d\ + d 2 = 0, 

Efli( S )fl 2 (t) = ^{^(H + ltl-lt-a]) (6.3) 
+ g 2 (t\og\t\ + s log |s| - (t- s)log\t- s\)\, 

where 

gi = (l/2)^(di,d 2 )(cos(7rdi) + cos(-7rd 2 )), 52 = ip(di, d 2 )(d 2 - d\). (6.4) 
6.2 Proofs of the propositions 

Proof of Proposition 12.21 The first relation in (|2.7p is immediate from (|2.5p and (|2.6p ; 
see also Giraitis et al. (2006, (3.4)). Then, the second relation in (|2.7p follows from (|2.8p . 
which is proved below. 

Assume without loss of generality that /ij = fij = 0. By Assumption A(d±,d 2 ), there 
exists a constant C such that for any n,h > 1, 

e( E Xi ^) ^ Ch 1+2d \ z = 1, 2. (6.5) 

t=n-h+l 

In particular, EXf = 0(n 2dl " 1 ). Let h > 1. Then 

1 n 1 " /? 

t=n—h-\-l t=n—h+l 

Clearly, (j2.8j) follows from g/n — >• and 

<? n 

-1*1 E E = o p {n^-^). (6.6) 

By (|6.5p and Cauchy-Schwarz inequality, 

El^lEl E ^ eV^eV^I 53 Xj (t) 

h=l t=n- 



/i=1 t=n-/i+l 7i=l i=n— 

/ 9 \ 1/2 



since g = o(n). This proves (|6.6p . □ 
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Proof of Proposition 12.61 Both parts (i) and (ii) follow from the joint convergence 

((q/n) 2d 'V l /S ll , q ,(q/n) 2d2 V 2 /S 2 2, q ) ^iaw (^1,^2), (6-7) 

with Ui,U 2 as in (I2,14p , The last relation follows similarly as in Giraitis et al. (2006, 
Lemmas 7.1 and 7.2). □ 

Proof of Proposition [221 (i) Recall from Remark [27J that p = p w . From ([2~7]) . (f2TT7]) 

and Assumptions A(di, d 2 ) and B(di, d 2 ) (with d\ = d 2 = d) it follows that 



-2d q -2d Q 
Q <->11,<j — Q dll,q 



(q- 2d S 12 , q ) 2 
q- 2d S 22 , g 
r 2 

12 r-\ 2\ 

P en = cii(l - pr) 

C22 



and 



„-d-(l/2) 



fdd (V5T(5i(t) - pS 2 (T)), ^B 2 {t)) 



=fdd 



where (-Bi(t) = (B\(t) — pB 2 {r)) / \/l — p 2 ) is a fBm independent of (^(r)); see Remark 
M- Let (£°(t) = ^i(r)-rSi(l),r G [0,1]), (fl°(r) = B 2 (t) - tB 2 (1), t G [0,1]) be 
respective fractional Brownian bridges. 

These relations together with (|2.7|) imply similarly as in Giraitis et al. (2006) that 

(6.8) 



n- 2d V x 


-h. 


n- 2d V 2 


-h. 


- 2d Su, g 




~ 2d S22, q 





aw 



aw 



C22 



J\bI{t)) 2 &t-(J\1(t)&t) 2 ^ , (6.9) 



,2\ 



(6.10) 

C22 (6.11) 



as n, q, n/q — > 00, as well as the joint convergence of the four quantities in (|6.8[) - f|6.11|) . Since 
the limits in (j6.8[) - (j6.11[) are a.s. strictly positive and (f?i(r)) is independent of (B 2 (t)), 
this proves (I2.19|) and part (i). 

(ii) From ([277]), ([2717)) we have 



q- 2d *S 22 , q 



c 2 

> P cn-^ 2 - = cn(l-/> 2 ). (6.12) 

C22 
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From (|2.7p and Assumption ~B(d\,d2) we obtain that 



^fdd VHir^i(r). (6.13) 
Using similar arguments as in part (i), from (|6.12|) and (|6.13|) we get 

x2fdi-d2 ^l/^ll,g ^/„^2(d 2 -da)I^/^22 l£ ^ f gi (1 - p 2 )U 2 



\ ^2/522,5 Vl/S n ,qJ V C 1 ~ P ) U 2 U\ J 

where Ui,i = 1,2 are defined in ([213)1 . Clearly, ([613)1 implies ([230)1 and part (ii). 
(iii) In this case, ()6. 12[) is again valid but f)6. 13|) must be changed to 

rl d2-di [nr] 

^(X 1 (t)-X 1 ) = 



n d 2 +l/' ^ | 



/C22 

From (|6.15p we obtain 



> fdd -^=^2(r). (6.15) 



n 2(d 2 -d 1 ) / fl .rl \ 2\ 

-Vi ^ law en/) 2 Mf ( J B 2 °(r)) 2 dr - ( jf B 2 (r)dr) J 



2d 2 



Combining this result with ([6. 12[) and the convergences in (j6.9|) . ([6. lip , with d = d,2, one 
obtains 

Vi/Sn, g ^ 
V 2 /S 2 2, q law 1-P 2 ' 
proving (HOT)) . □ 

Proof of Proposition 12.111 We shall prove the second inequality in (j2.30p only since the 
first one can be proved analogously. We shall use the following elementary inequalities: for 
any r.v. £, rj > and any x > 

P(- < x) < P(£ < y/x) +P(r? > 4=). ( 6 - 16 ) 
P(f ?7 < x) < P(f < v^) + P(v < Vac), P(f - r? < s) < P(f < 2s) + P(r? > x). 

Denote 
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Then 



P(f n <o) < p(^-<x) (6.17) 



-l/2x 



< P(fl < X 1 / 4 ) + P(£ 2 < X 1 / 4 ) + P(| 3 > X- 1 / 4 ) + P(£ 4 > X 

32 T 7 _ (-\ /our _ P 2 Su, q 



-1/4a 



Next, using the inequality Vi > (1/2) Vi - /3 2 V2 = (1/2) Vi - f ^ li1 ^ and the facts that 
\p\ <1,V 2 > 0, we get 

6 > (1/2)6 -m, (6-i8) 

where 

'11,9/ -2d, t/n _ 



'-'22, g 52 

Relations (|6TT8jl and (IdTTHI) yield 

P(£i<z 1/4 ) < P((l/2)6-r?i<x 1 / 4 ) (6.19) 

< P(£i < 4x 4 / 4 ) + P( m > x 1 / 4 ) 

= P(& < 4X 1 / 4 ) + P(x^ > x 1 / 4 ) 

?2 

< P(6 < 4X 1 / 4 ) + P(^ 3 > x" 1 / 4 ) + P(£ 4 > £- 1/4 ) + P(6 < x 1 / 4 ). 
Combining (|6.17p and (|6.19j) and using Sn t q > Sii^, see (|2.17p . we obtain 

P(T„, < a) < P(a < 4X 1 / 4 ) + 2P(£ 2 < x 1 / 4 ) + 2P(6 > x" 1 / 4 ) + 2P(£ 4 > aT 1 ^^) 
From Assumption A(di,d2) and (|6.30p we obtain 

E^ii,9 = ^E(^X 1 (t)) 2 + E^(l--^L)(5ii(fc)-^ 1 (fc)) (6.21) 



< #2 



t=0 |/i|<g 

7 2di 
2d 2 



EV2 < if 3 n 

for some constants K2, K3 independent of n, q, implying E^3 < if 2, E£ 4 < K%. From (16.20p . 
(|6.2ip . the Markov inequality, and assumption (|2.29p . the statement of the proposition easily 
follows. □ 

Proof of Proposition 13.11 With exception of (|2.6p . all other facts in the statement 
of the proposition follow similarly or using the argument developed in Davydov (1970), 
Bruzaite and Vaiciulis (2005), Giraitis et al. (2006) and other papers. In particular, the 
joint convergence of partial sums of (X\,X2) can be proved by using the scheme of discrete 
stochastic integrals in Surgailis (2003). See also Chung (2002, proof of Theorem 1). 
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Let us prove the convergence of empirical long-run covariances in (|2.6p or, equivalently, 
in (HTTP. Denote 



XijV) = *,J = 1,2 (6.22) 

fe=0 

It suffices to show the convergence of the HAC estimates of long-run covariances c^yy 
of components Jfy and Xiy in (|6.22p . for any pairs (i,j),(i',f) G {1,2} x {1,2}; more 
precisely, to show that 

<Z '' ^ P c^f (i,j,i',j' = 1,2), (6.23) 

where is defined as in (|2.1j) with Jij(h) replaced by the empirical covariance Jij,i'j'(h) 

between observations Xy (t), i = 1, • • • , n and X^y (t), t = 1, ■ ■ ■ , n. 

Fix i,j,i',f G {1,2} and denote X(t) = Xij(t), X'(t) = Xi'f(t), d = dij, d' = dyf, 



ij){k) = Tpij(k), Tp'(k) = ipi'f(k), £(s) = £'(s) = £f(s), p ? = p Uj > for short. Write 

S q = S' q + S'' where 



h=—q h=—q 

where 



7(/i) = n 



is the empirical covariance from noncentered observations; c.f. ([2.2p , Then (|6.23p follows 
from 

q- d - d 'S' q ^ p c, S' q ' = o p (q d+d '). (6.24) 
In the subsequent proof of the first relation of ()6.24p . we first assume d > 0, d! > 0. Split 



= 7(0) + Efel-g (l - J+t) 7(^) + E/Ll f 1 - g+l) 7W- Here > the last tw ° SUmS Can be 

treated similarly and 7(0) = O p {l) is negligible. Consider 

h=l v H 7 i=l 



where 



* = «EE f 1 -^! j-E^-*)^*-*). 

<J , Ji \ 1 n_/l 

^ = E^E i-rxr -E^-^'(*+^-^ 



(6.25) 

^ = EE^-^l-E^-'y^+fc-oew^c-o. (6.26) 
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where tj s = £(s)£'(s) — E£(s)£'(s) = £(s)£'(s) — p^, the sums and are taken over 

all s G Z and s,s'sZ,s^ s', respectively, and where we put ^(i) = ^'(t) = (t < 0). 

First, consider the (nonrandom) term Lq. Using the asymptotics of ip and ^' and the 
dominated convergence theorem, we easily obtain that, as q — > oo, n — > oo, q/n — > 0, 

n—h oo 

ip(k)^'(h + fc) 



ft=l v 4 7 t=l fc=0 

h =i jo 



k d - 1 {h + k) d - 1 dk 

~ ^aa'B^l-d-d')^^ 1 

/i=l 

where c = 2p£ a a' B(<i, 1 — d — d')/(d + d'). Then, the first relation in (I6.24p follows from 

g-*-*^ = o p {\) (i = 2,3). (6.27) 

To estimate C/2, we use the fact that (rj(s), s G Z) are i.i.d.r.v.'s, the well-known inequality 
E| X^i — 2 X^i E|Mj| p for independent zero mean random variables Mi with E|Mj| p < 00 
and 1 < p < 2 (see Bahr and Esseen (1965)), the fact E|r/ S | p = C p < 00 for some p G (1, 2) 
and the Minkowski inequality. Using these facts, we obtain 

E^E(Ei^ 

(1 00 1 / \ p 

E(E s f- 1) (^+^ (d '- 1) ) ) 

< c(^(^ +d '- 2 ) +1 ) 1/p ) P 

<? 

< c^^/ l (rf+rf'-2)+(l/ P )^ P < CqP ((l/ P )+d+d'-l) 

h=l 

and therefore E 1 / p |t7" 2 | p = 0(g d+d ' + ( 1 / p )- 1 ) = o(g d+d '), as p > 1, proving (|B27)) for i = 2. 
Next, consider f7 3 . Using the fact that £~ 1 s rf ~ 1 (*+s) d '~ 1 < Ct d+d ' _1 (i > 0), we obtain 

/ q / h \ 1 n_fe N 2 

E ^ 2 < 2E(E( 1 -^tt)^E^-^+ /i - s/ )) 
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c n q 



^ ^E E EWt-^ + fc-^t'-^ + h'-*')! (6.28) 

t,f=l h,h'=l s,s' 

< E \t-t'\r i \t-t'+h- h '\^ 

t,t'=l h,h'=l 

< — E £i*i+~ 1 i*+*i+'~ 1 ^ c ( J i + J 2 



where 



|t|<n \h\<q 



h = (ff/n) £ Itir 1 £ l^f- 1 < C(q/ntfM = (gW)), 
|t|<2g N<3g 

J 2 = tf/ n ) \t\ 2d+2d '- 2 < C(q 2 /n)< 

2q<\t\<n 



q 2d+2d'~l^ d + d'< 1/2 

Uog(»/g), d+tf = 1/2 

and so J 2 = o(q 2 ( d+d )) as g, n, n/g — >• 00 in all three cases (in the last case d + d' = 1/2 this 
follows from the fact that x — )• entails xlog(l/x) — > 0). 

This proves the first relation of (|6.24[) for d > 0, cf > 0. 

Consider now the case d = d! = we want to prove that 



S q — >p C, 



where 



c = i^ rt (E^))(E^)) 

n n 

= p 6 lim )i _1 VV)/i({-m)Vi/)'(s-m) 

n— >oo * — » * — » ' — » 

= p 5 aa'. (6.29) 

By writing S' q = ES*^ + (S' q — ES 1 '), the convergence ES^ — > c follows similarly as in (|6.29|) 
above. Relation S' q — ES' q = o p (l) can be shown similarly to (|6.27p . i.e., by splitting S' q — ES' q 
into "diagonal" and "off-diagonal" parts in the quadratic form in noise variables. Consider 
the "diagonal" part U 2 in (JQg]> . Then 

E|C/ 2 | p < 2<7 p W^if>(t- S )V/(i + / i - S )|Y ^ C(W 1 + W 2 ), 

s ^h=l 71 t=l ' 

where 

n , q 1 n 



- / _ 1 " \ p 

^ = E E-Ei#- s )^+' i - s )i 

S=l ^/l=l t=l 7 

n / 00 \ p 

< CrT^ K>(i- S )| = O(n^) 

s=l ^ t=l 7 



0(1) 
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since p > 1, while 



oo , q n 

w * = E E-Ei^ + ^ + ^+^i 

s=0 ^ft=l t=l ' 
oo / n \ p 

s=0 ^ i=l ' 

/ n / oo \ l/p\ p 

\ t=l ^ s=0 ' ' 



where we used the Minkowski inequality and the dominated convergence theorem to get 
o(l), in view of the fact that I'M* + S )\ P * s bounded in t and tends to zero as t — > 00. 

Consider the "off-diagonal" term U3 in (|6.26|) . Noting that, for fixed h,h', the sum 
in (|6.28p over all t,t',s,s' G Z is bounded by a constant independent of h,h', we get 
Et/f < C(q/n) 2 = o(l). 

This proves the first relation of (|6.24[) for d > 0, d! > and (i = d' = 0. 

Let us prove the second relation in (|6.24|) . It follows from 

Y,nm-m\ = o(q d+d '). (6.30) 

\h\<q 

Using definitions of 7(/i),7(/i), the Cauchy-Schwartz inequality and (|2.5p . for /i > one 
obtains 

(E\m-m\) 2 

n— ft s 2 / 1 n— ft \ 2 

± + +E(X') 2 E i^X(t) +E(X) 2 E(X0 2 

< Cn 2d+2d '- 2 (6.31) 

and so (|6.30p reduces to Cqn d+d ~ l = o(q d+d ') which is a consequence of d + <f < 1 and 
<2</n ->■ 0. 

This concludes the proof of (|6.24p for d > 0, d! > and d = d' = 0. The cases d > = d! 
and d = < d 1 can be treated in a similar way. Proposition 13.11 is proved. □ 

6.3 Derivation of the adaptive bandwidth 

The aim of this section is to derive the adaptive bandwidth formula used in our simulations, 
viz. 

q = O.^P^' ifJ<1 /4> (6.32) 
\n x /2-d, if d > 1/4, 

where d = {d\ + d,2)/2 is an estimator of the (common) long memory parameter d, 

i = r _ b<zh , d * — , (6 .33) 

7o W(0) 52(0)/ x 2d sin 2 (x/2) 
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and where gi is an estimator of the short memory part g%(x) = fi(x)/\x\ 2di of the spectral 
density /j of Xj, % = 1, 2. In this paper, is the spectral density of the best AR approxi- 
mation of gi which is computed following the two step procedure in Ray and Crato (1996). 
Namely, we first estimate dj and then we fit an AR process to (1 — L) di Xi using the BIC 
criterion. 

From Abadir et al. (2009, Theorem 2.1) under similar assumptions on Xi as in Section [2] 
we have the following expansion of Su q'. for < dj < 1/4, 

q- 2dl S ii!g = Cii + (g/n) 1 / 2 5i (0)(Z ni + op(l)) + ^"^(OXA + o p (l)), (6.34) 

where Z ni -^ aw Zj ~ iV(0, v(dj)), 

u(di) = 8tt / — -ttt^) x l dx, 
Jo V x/2 / 

5i(x) 1 \ dx 



^ 7o y gi (0)sm 2 (x/2) 1{0<x<7T} {x/2f)x^ 

and where = fi(x)\x\ 2di is the short memory component of the spectral density fa of 

Xi, which is assumed to be continuous at x = and gi{x) = gi(0) + O(x 2 ), x — > 0, pi(0) > 0. 
Note that the long-run variance c M - is related to <?i(0) by 

Cm = gi(0)p(di), (6.35) 

where p(d) = 2r(l — 2d) sin(7rd)/d(l + 2d) depends only on d. 

From the form of statistic T n it is clear that q must be chosen so that the ratio cwjcii 
is well estimated by S\\ A ) ' S%2,q- From (|6.34p . assuming d\ = di = d as under the null 
hypothesis, we obtain 

f^ii-1 = (? /„)V2^^ (1 + 0p(1)) + g -i- M ^^ (1 + 0p(1)) . (6.36) 

A22,g/C22 P(dj P(«) 

Therefore as n, q,q/n — >■ oo, 

^(f^-l) 2 ~ -^((g/nJE^-^ + g- 2 ^! 9 ), (6.37) 

^22,g/C22 ' H«) V 7 

since for d\ = d<i = d, we have B\ — B2 = /, where 

j r / giQc) _ g2(^) \ 

" 7o Wl(0) ^W^sm 2 ^)' 
c.f. (j6.33p . Minimizing the right-hand side of (16.370 with respect to q, we obtain 

g = K 1 (d)\I\ 2 K 3 ^n 1 K 3+4d >, (6.38) 

where K\(d) depends on d and E(Zi — Z2) . Numerical computation of the function ifi(d) 
reveals that it is well approximated by the constant value 0.3 on the interval (0, 1/4) except 
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for the case when d is close to 1/4 and then K\(d) diverges but then also the approximations 
in ()6.34p and (|6.37p are less accurate. Therefore we choose to replace K\{d) in ()6.38p by 
0.3 on the whole interval d 6 (0, 1/4) in order that the test is not too conservative. For 
similar reasons, we replace the exponent of |/| in (|6.38[) by 1/2, since otherwise the test 
turns out to be too conservative for small values of d when (X\) and (X2) have very 
different short memory parts (or high values of |/|). The result of these replacements is 
q = 0.3\I\ 1 / 2 n 1 /( 3+M \ c.f. (|Q2]> . 

Next, let us turn to the case 1/4 < d < 1/2. From Abadir et al. (2009, Theorem 2.1) we 
obtain that for 1/4 < di < 1/2 

q- 2di S° i>q = cu + (q/n) l - 2di 9i (0)(Z ni + 0p (l)) + ^-^(O)^ + o p (l)), (6.39) 

where Z n i — >\ sm Z and Z{ is a (non-Gaussian) r.v. whose distribution depends only on di. 
From Proposition 2.4 (|2.8p we have that 



q- 2di (Sii, q -S° iiq ) = -liq/n^gMiY'+o^l)), (6.40) 

where 

Ym = gm-^n^-^Xi ^ law Yt ~ n(o, -^-) = N(0,p(di)), (6.41) 

see (j6735l> . Combining ([6T39|) - ([674T]) and using the facts that EZ, = 0, i = 1,2 and Elf = 
Elf, similarly as in (|6.36p and (|6,37p we obtain 

= ( q /nf- 2d Znl - ^ "f" 2 + 2y " 2 )(l + o p (l)) 

S22,q/C22 P[d) 

and 

E (SnJcn_ i y „ lf {q/n) 2(i-2 d ) EJ 2 {d) + q -2 { i + 2 d)l 2\ {6A2) 

^S 2 2,q/C22 > V\P) V 1 

where J{d) = Z\ — 2Y 2 — Z2 + 2lf has a distribution depending on d alone. Minimization 
of the right-hand side of (|6.42p with respect to q leads to 

q = K 2 {d)\I\ l/2 n 1 / 2 - d , (6.43) 

where K2{d) is a function of d. In this case, we also choose ^(d) = 0.3 for similar reasons 
as in the case d < 1/4 above. This completes our derivation of the bandwidth formula 
([Q2]) . 
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